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Abstract. Model counting is the task of computing the number of as¬ 
signments to variables V that satisfy a given propositional theory F. 
The model counting problem is denoted as #SAT. Model counting is 
an essential tool in probabilistic reasoning. In this paper, we introduce 
the problem of model counting projected on a subset of original variables 
that we call priority variables PCV. The task is to compute the number 
of assignments to V such that there exists an extension to non-priority 
variables V \ V that satisfies F. We denote this as #3SAT. Projected 
model counting arises when some parts of the model are irrelevant to the 
counts, in particular when we require additional variables to model the 
problem we are counting in SAT. We discuss three different approaches 
to #3SAT (two of which are novel), and compare their performance on 
different benchmark problems. 


1 Introduction 

Model counting is the task of computing the number of models of a given proposi¬ 
tional theory, represented as a set of clauses (SAT). Often, instead of the original 
model count, we are interested in model count projected on a set of variables V. 

Given a problem on variables V , we may need to introduce additional vari¬ 
ables to encode the constraints on the variables V into Boolean clauses in the 
propositional theory F. Counting the models of F does not give the correct 
count if the new variables are not functionally defined by the original variables 
V. Thankfully, most methods of encoding constraints introduce new variables 
that are functionally defined by original variables, but there are cases where the 
most efficient encoding of constraints does not enjoy this property. Hence we 
should consider projected model counting for these kinds of problems. 

Alternatively, in the counting problem itself, we may only be interested in 
some of the variables involved in the problem. Unless the interesting variables 
functionally define the uninteresting variables, we need projected model count¬ 
ing. An example is in evaluating robustness of a given solution. The goal is to 
count the changes that can be made to a subset of variables in the solution such 
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that it still remains a solution (possibly after allowing some repairs, e.g. in su¬ 
permodels of a propositional theory na). The variables representing change are 
priority variables. In our benchmarks, we consider an example from the planning 
domain, where we are interested in robustness of a given partially ordered plan 
to the initial conditions, i.e., we want to count the number of initial states, such 
that the given partially ordered plan still reaches the given goal state(s). 

Projected model counting is a challenging problem that has received little 
attention. It is in #P NP . If all the variables are priority variables, then it be¬ 
comes a ff SAT problem (#P), and if all variables are non-priority variables, 
then it reduces to SAT (NP). There has been little development of special¬ 
ized algorithms for projected model counting in the literature. Some dedicated 
attempts at solving the problem are presented in m and |H]. In the latter, 
the primary motivation is solution enumeration, and not counting. Closely re¬ 
lated problems are projection or forgetting in formulas that are in deterministic 
decomposable negation normal form (d-DNNF [3]) [5], and Boolean quantifier 
elimination mm]. 

In this paper, we present three different approaches for projected model 
counting. 

— The first technique is straight-forward and its basic idea is to modify DPLL- 
based model counters to search first on the priority variables, followed by 
finding only a single solution for the remaining problem. This technique is 
not novel and has been proposed in m It has also been suggested in [I5j in a 
slightly different context. Unlike m which uses external calls to Minisat to 
check satisfiability of non-priority components, we handle all computations 
within the solver. 

— The second approach is a significant extension of the algorithm presented in 
0 . The basic idea is that every time a solution S is found, we generalize it by 
greedily finding a subset of literals S' that are sufficient to satisfy all clauses 
of the problem. By adding -i S' as a clause, we save an exponential amount 
of search that would visit all extensions of S'. This extension conveniently 
blends in the original algorithm of [H] , which has the property that the num¬ 
ber of blocking clauses are polynomial in the number of priority variables at 
any time during the search. 

— Our third technique is a novel idea which reuses model counting algorithms: 
computing the d-DNNF of the original problem, forgetting the non-priority 
variables in the d-DNNF, converting the resulting DNNF to CNF, and count¬ 
ing the models of this CNF. 

We compare these three techniques on different benchmarks to illustrate their 
strengths and weaknesses. 

2 Preliminaries 

We consider a finite set V of propositional variables. A literal l is a variable dGV 
or its negation ->n. The negation of a literal —is ->n if l = v or v if l = —>v. Let 




var(l) represent the variable of the literal, i.e., var(v) = var(-iv) = v. A clause 
is a set of literals that represents their disjunction, we shall write in parentheses 
(l i,..., l n ). For any formula (e.g. a clause) C, let vars(C) be the set of variables 
appearing in C. A formula F in conjunctive normal form (CNF) is a conjunction 
of clauses, and we represent it simply as a set of clauses. An assignment 9 is a set 
of literals, such that if l £ 6, then ->l (j 9. We shall write them using set notation. 
Given an assignment 9 then -> 9 is the clause \J lee Given an assignment 9 over 
V and set of variables P then 9p = {l \ l £ 9, var(l ) £ P} 

Given an assignment 9 , the residual of a CNF F w.r.t. 9 is written F\g and is 
obtained by removing each clause C in F such that there exists a literal l £ CC\9, 
and simplifying the remaining clauses by removing all literals from them whose 
negation is in 9. We say that an assignment 9 is a solution cube , or simply a 
cube, of F iff F\g is empty. The size of a cube 9 , size(9) is equal to 2l v Al e l. A 
solution in the classical sense is a cube of size 1. The model count of F, written, 
ct(F) is the number of solutions of F. 

We consider a set of priority variables PC V. Let the non-priority variables 
be J\f, i.e., TV" = V \ V. Given a cube 9' of formula F, then 9 = 9' v is a projected 
cube of F. The size of the projected cube is equal to 2 l' P r l e l. The projected 
model count of F, ct(F,V) is equal to the number of projected cubes of size 1. 
The projected model count can also be defined as the number of assignments 9 
s.t. vars(9) = V and there exists an assignment 9' s.t. vars{9') = J\f and 9 U 9' 
is a solution of F. 

A Boolean formula is in negation normal form (NNF) iff the only sub¬ 
formulas that have negation applied to them are propositional variables. An NNF 
formula is decomposable (DNNF) iff for all conjunctive formulae ci A • • • A c n in 
the formula, the sets of variables of conjuncts are pairwise disjoint, vars(ci ) PI 
var(cj) = 0,1 < i ^ j < n. Finally, a DNNF is deterministic (d-DNNF) if 
for all disjunctive formulae d\ V • ■ ■ V d n in the formula, the disjuncts are pair¬ 
wise logically inconsistent, di A dj is unsatisfiable, 1 < i j < n. A d-DNNF 
is typically represented as a tree or DAG with inner nodes and leaves being 
OR/AND operators and literals respectively. Model counting on d-DNNF can 
be performed in polynomial time (in d-DNNF size) by first computing the satis¬ 
faction probability and then multiplying the satisfaction probability with total 
number of assignments. Satisfaction probability can be computed by evaluating 
the arithmetic expression that we get by replacing each literal with 0.5, V with 
+ and A with x in the d-DNNF. 


3 Model Counting 


In this section we review two algorithms for model counting that are necessary 
for understanding the remainder of this paper. For a more complete treatment 
of model counting algorithms, see [12]. 


3.1 Solution enumeration using SAT solvers 

In traditional DPLL-algorithm [B], once a decision literal is retracted, it is guar¬ 
anteed that all search space extending the current assignment has been ex¬ 
hausted. Due to this, we can be certain that the search procedure is complete 
and does not miss any solution. This is not true, however, for modern SAT 
solvers m that use random restarts and First-UIP backjumping. In the latter, 
the search backtracks to the last point in search where the learned clause is 
asserting, and that might mean backjumping over valid solution space. It is not 
trivial to infer from the current state of the solver which solutions have already 
been seen and therefore, to prevent the search from finding an already visited 
solution 6 , SAT solvers add the blocking clause in the problem formulation 
as soon as 9 is found. 

3.2 DPLL-style model counting 

One of the most successful approaches for model counting extends the DPLL 
algorithm (see mm)- Such model counters borrow many useful features from 
SAT solvers such as nogood learning, watched literals and backjumping etc to 
prune parts of search that have no solution. However, they have three additional 
important optimizations that make them more efficient at model counting as 
compared to solution enumeration using a SAT solver. A key property of all 
these optimizations is that their implementation relies on actively maintaining 
the residual formula during the search. This requires visiting all clauses in the 
worst case at every node in the search tree. 

Say we are solving F and the current assignment is 9. The first optimization 
in model counting is cube detection; as soon as the residual is empty, we can stop 
the search and increment our model count by size(9). This avoids continuing the 
search to visit all extensions of the cube since all of them are solutions of F. 
The second optimization is caching [1] which reuses model counts of previously 
encountered sub-problems instead of solving them again as follows. Say we have 
computed the model count below 6 and it is equal to c, we store c against F\g. 
If, later in the search, our assignment is 9' and F\g = F\g> , then we can simply 
increment our count by c by looking up the residual. The third optimization 
is dynamic decomposition and it relies on the following property of Boolean 
formulas: given a formula G, if (clauses of) G can be split into Gi,..., G n such 
that vars(Gi) fl vars(Gj) = 0,1 < i ^ j < n, and (J, igl n vars(Gi) = vars(G ), 
then cf(G) = ct(G\) x ... x ct{G n ). Model counters use this property and split 
the residual into disjoint components and count the models of each component 
and multiply them to get the count of the residual. Furthermore, when used with 
caching, the count of each component is stored against it so that if a component 
appears again in the search, then we can retrieve its count instead of computing 
it again. 

4 Projected Model Counting 

In this section, we present three techniques for projected model counting. 




4.1 Restricting search to priority variables 


This algorithm works by slightly modifying the DPLL-based model counters as 
follows. First, when solving any component, we only allow search decisions on 
non-priority variables if the component does not have any priority variables. 
Second, if we find a cube for a component, then the size of that cube is equal to 
2 to the power of number of priority variables in the component. Finally, as soon 
as we find a cube for a component, we recursively mark all its parent components 
(components from earlier decision levels whose decomposition yielded the current 
component), that do not have any priority variables as solved. As a result, the 
count of 1 from the last component is propagated to all parent components whose 
clauses are exclusively on non-priority variables. Essentially, we store the fact 
that such components are satisfiable. 

Example 1. Consider the formula F with priority variables V = {p,q,r} and 
non-priority variables M = {x, y, z}. 

(-• 9 , + -■??), (-r, ->y, z), (r, -iz, ~^p), (z, y, ->p, r), (r , z, ~>y, =p), (p, q) 

Here is the trace of a possible execution using the algorithm in this subsection. 
We represent a component as a pair of (unfixed) variables and residual clauses. 

la. Decision p. The problem splits into C\ — ({y, x}, a;)}) and 

C 2 = ({r, y, z}, {(-ir, -iy, z), (r, ->z), (z, y, r), (r, 2 , ->y)}). 

2a. We solve C\ first. Decision -<q. We get C 3 = ((x},0) and ct(C 3 ,V) = 1 
(trivial), we backtrack to C\. 2b. Decision q, propagates x, and it is a solution. 
We backtrack and set ct{C\,V) = ct(C 3 ,V) + 1 = 2 . 

2c. Now, we solve C 2 . Decision r gives C 4 = ({y, 2 }, {(—>j/, 2 )}). 

3a. Decision 2 , we get C 5 = ((y},0) and ct{C*,,V) = 1. We backtrack to level 
C 2 setting cf (C4, V) = 1 since the last decision was a non-priority variable. 

2d. Decision ->r fails (propagates 2 , y, -> y). We set ct(C 2 ,V) = ct(Ci,V) = 1 
and backtrack to root F to try the other branch. 

lb. Decision —>p, propagates q and gives C@ = ({x}, 0) and C 7 = ({r, y, z}, {(-r, -1 y, 
We note that ct(Cs,V) = 1 (trivial) and move on to solve C 7 . 

2e. Decision ->r gives Cg = ({y}, 0) and Cg = ({ 2 }, 0) with counts 1 each. We go 
back to C 7 to try the other branch. 

2f. Decision r gives C 10 = ({y, 2 }, {(=y, 2 )}) which is the same as C 4 which has 
the count of 1. Therefore, cf (Cy, V) = ct{C%, V) x ct(Cg, V) + ct(C < 4 , V) = 2. All 
components are solved, and there are no more choices to be tried, we go back to 
root to get the final model count. 

A visualization of the search is shown in Figure [T] The overall count is 
ct(F,V) = ct{C\,V) x ct{C 2 , V) + ct{C 6 ,V) x ct{C 7 , V) = 4. □ 

4.2 Blocking seen solutions 

This approach extends the projected model counting algorithm given in 
which has been implemented in the ASP solver CLASP mi. The algorithm is 
originally for model enumeration, not model counting, and therefore, it suffers in 
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Fig. 1 : A visualization of the search tree for model counting with priority vari¬ 
ables. Nodes are marked with residual clauses and counts. Dotted edges indicate 
dynamic decomposition, dashed edged indicate backjumps over non-priority de¬ 
cisions. 


instances where there are small number of cubes, but the number of extensions 
of these cubes to solutions is large. We present a modification of the algorithm 
that does not have this shortcoming. But first, let us briefly summarize the 
motivation behind the algorithm and its technical details. 

The motivation presented in [S| is absence of any specialized algorithm in 
SAT (as well as ASP) for model enumeration on a projected set of variables, 
and the apparent flaws in the following two straight-forward approaches for 
model enumeration. The first is essentially the approach in ld.ll without dynamic 
decomposition, caching, and cube detection, i.e., to search on variables in V first 
and check for a satisfying extension over A f. This interference with the search 
can be exponentially more expensive in the worst case, although this approach is 
not compared against other methods in the experiment. The second approach is 
to keep track of solutions that have been found and for each explored solution 9, 
add the blocking clause ->6-p (this is also presented in |13j , although the algorithm 
restarts and calls Minisat by adding the clause each time a solution is found). 
In the worst case, the number of solutions can be exponential in \V\, and this 
approach, as experiments confirm, can quickly blow up in space. Note that, as 
opposed to the learned clauses which are redundant w.r.t. the original CNF and 
can be removed any time during the search, the blocking clauses need to be 
stored permanently, and cannot be removed naively. 

The algorithm of [ 8 ] runs in polynomial space and works as follows. At any 
given time during its execution, the search is divided into controlled and free 
search. The free part of the search runs as an ordinary modern DPLL-based 







SAT solver would run with backjumping, conflict-analysis etc. In the controlled 
part of the search, the decision literals are strictly on variables in V and how they 
are chosen is described shortly. Following the original convention, let bl represent 
the last level of controlled search space. Initially, it is equal to 0. Every time a 
solution 0 (with projection dp) is found, the search jumps back to bl, selects a 
literal x from dp that is unfixed (at bl), and forces it to be the next decision. It 
increments bl by 1, adds the blocking clause —i Op and most importantly, couples 
the blocking clause with the decision x in the sense that when we backtrack 
from x and try (force) —>x, ~^dp can be removed from memory as it is satisfied 
by —>x. Additionally, backtracking in the controlled region is provably designed 
to disallow skipping over any solution. Therefore, when we try —>x, all solutions 
under x will have been explored. Furthermore, with -<x, all subsequent blocking 
clauses that were added will have been satisfied since all of them include ~<x. 
This steady removal of clauses ensures that the number of blocking clauses at 
any given time is in 0 (\V\). 

We now describe how we extend the above algorithm by adding solution 
minimization to it. We keep a global solution count, initially set to 0. Once 
a solution d is found, we generalize (minimize) the solution as shown in the 
procedure shrink Figure [2j We start constructing the new solution cube S by 
adding all current decisions from 1 .. .bl. Then, for each clause in the problem 
(C in pseudo-code) and current blocking clauses ( B ), we intersect it with the 
current assignment. If the intersection contains a literal whose variable is in M or 
S, we skip the clause, otherwise, we add one priority literal from the intersection 
in S (we choose one with the highest frequency in the original CNF). After 
visiting all clauses, we use -■ S as a blocking clause instead of the one generated 
by the algorithm above ( _l ^p). Finally, we add to the global count. The 

rest of the algorithm remains the same. Note that the decision literals from the 
controlled part of the search are necessary to add in the cube, since the algorithm 
in 0 assumes that once a controlled decision is retracted, all the blocking clauses 
that were added below it are satisfied. This could be violated by our solution 
minimization if we do not add controlled decisions to S. 

Example 2. Consider the CNF in Example [T] Initially, the controlled search part 
is empty, B = 0 and bl = 0 as per the original algorithm. Say CLASP finds the 
solution: {p, -<q, x, z, r, ~<y}. shrink produces the generalized solution: S = {r,p} 
by parsing the clauses (r, z, ~>p) and (p, q) respectively (all other clauses can be 
satisfied by non-priority literals). We increment the model count by 2 (2 3_ l s l), 
store the blocking clause ~>S = (~<r,->p) and increment bl by 1. Say, we pick r, 
due to the added blocking clause, it propagates -<p, which propagates q. Say that 
CLASP now finds the solution {r, ->p, q, ->y, z, x}. In shrink, we start by including 
r in S since that is a forced decision, and then while parsing the clauses, we get 
S = {r, —ip, q}. Note that if we didn’t have to include the blocking clause (~<r, -> p), 
then we could get away with S = {r, q} which would be wrong since that shares 
the solution {r,q,p} with the previous cube. We increment the count to 3 and 
cannot force any other decision, so we try the decision -i r in the controlled part. 
At the same time, upon backtracking, we remove all blocking clauses from B , so 


% universal solution cube 


shrink(6>) 

S:= {} 

for (i £ 1... bl) 

S.add(dec(i )) % add decision to S 

for (c£CUB) 

/ := false 
for (Z € C) 

if (Z e 0) and (l £ Af ox l £ S) 
f := Zrae 

break 

if (/ = false) % if nothing makes the clause true already 

let p £ 9 Cl C : var(p ) £ "P % pick p with highest freq. 

S.add(p ) % add literal to cube 

ct := cZ + 2l p l-l s ' 

B .add(-<S) 


Fig. 2: Pseudo-code for shrinking a solution 6 of original clauses C and blocking 
clauses B to a solution cube S, adding its count and a blocking clause to prevent 
its reoccurrence. 


it is now empty. Say CLASP finds the solution {-■ r,-ip,q,x,~<y,z}, shrink gives 
S = {-ir, -i p, q}. We increment the count to 4, and when we add ~>S as a blocking 
clause, there are no more solutions under —>r. Therefore, our final count is 4. The 
visualization for this example is given in Figured □ 


4.3 Counting models of projected d-DNNF 

As mentioned in Section [2] it is possible to do model counting on d-DNNF 
in polynomial time (in the size of the d-DNNF), however, once we perform 
projection on V (or forgetting on J\T 0) by replacing all literals whose variables 
are in Af with true , the resulting logical formula is not deterministic anymore 
and model counting is no longer tractable (see [5]). 

In this approach, we first compute the d-DNNF of F, then project away the 
literals from the d-DNNF whose variables are in Af , convert this projected DNNF 
back to CNF, and then count the models of this CNF. The pseudo-code is given 
in Figure|U The conversion from d-DNNF to CNF is formalized in the procedure 
d2c, which takes as its input a d-DNNF (as a list of nodes Nodes ) and returns 
a CNF C. It is assumed that Nodes is topologically sorted, i.e., the children of 
all nodes appear before their parents. d2c maps nodes to literals in the output 
CNF with the dictionary UtAtNode. It also maps introduced (Tseitin) variables 
to expressions that they represent in a map UtWithHash. v represents the index 
of the next Tseitin variable to be created. d2c initializes its variables with the 
method init(). Next, it visits each node n, and checks its type. If it is a literal 
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Fig. 3: A visualization of counting models via blocking solutions. The curly arcs 
indicate free search, ending in a solution, with an associated count. The con¬ 
trolled search is indicated by full arcs, and blocking clauses associated with 
controlled search decisions are shown on arcs. 


and if it is a non-priority variable, then it is replaced with true (projected away), 
otherwise, the node is simply mapped to the literal. If n is an AND or an OR 
node, then we get corresponding literals of its children from the method simplify. 
We compute the hash to see if we can reuse some previous introduced variable 
instead of introducing a new one. If not, then we create a new variable through 
the method Tseitin which also posts the corresponding equivalence clauses in C. 
Finally, we post a clause that says that the literal for the root (which is the 
last node) should be true. The method simplify essentially maps all the children 
nodes to their literals. Furthermore, if one of the literals is true and the input 
is an OR-node, it returns a list containing a true literal. For an AND node, it 
filters all the true literals from the children. 

The next theorem shows that the method described in this section for pro¬ 
jected model counting is correct. 

Theorem 1 . ct(C) = ct(F,V) 

Proof (sketch). The entire algorithm transforms the theory from F to C by 
producing 2 auxiliary states: the d-DNNF of F (let us call it D) and the projec¬ 
tion of this d-DNNF (let us call this Dp). By definition, F and D are logically 
equivalent. On the other end, notice that the models of Dp and C are in one- 
to-one correspondence. Although the two are not logically equivalent due to the 
addition of Tseitin variables, it can be shown that these variables do not intro¬ 
duce any extra model nor eliminate any existing model since they are simply 
functional definitions of variables in V by construction (as a side note, the only 
reason for introducing these variables is to efficiently encode Dp as CNF, other¬ 
wise, C and Dp would be logically equivalent). Furthermore, we can show that 
the simplifications (replacing true V E with true and true A E with E) in the 
procedure simplify, and reusing Tseitin variables (through hashing) also do not 
affect the bijection. This just leaves us with the task of establishing bijection 
between the models of D and Dp , which, fortunately, has already been done in 
j4]. Theorem 9 in the paper says that replacing non-priority literals with true 



d2c( Nodes) 
initQ 

for (n E Nodes ) 

if (n is a literal Z) 
if ( var(l ) E A/") 

litAtNode[n] := true 
else HtAtNode[n] l 
elif (n = op(c i, . . . , c fc )) 

(Zi, . . . , lj) := simplify(n) 

if U = !) 

:= Zi 

else 

Zi : = hash(op, (Zi, . . . , lj)) 
if (litWithHash .hasKey(Zi)) 

ZitAtiVocZe[n] := litWithHash[h\ 
else 

v 7 := Tseitin(op, (Zi, . . . , Zj)) 
ZiM.ZiVocZe[n] := v 7 
lit WithHash [h] := v 7 
(7. add ({litAtNode [Nodes. lastQ]}) 
return C 


initQ 

C=(), litAtNode = {}, litWithHash — {} 

V := | V | 

simplify(op(ci, . . . , c fc )) 

L — 0 

for (c E ci, . . . , Cfc) 

if (ZiZAZiVocZe[c] = true) 

if (op — OR) return (true) 
else 

L.add( lit A tNode [c]) 

return L 

Tseitin(op, (Zi, . . . , lj)) 

Add clauses v -w* op(l±, . . . , lj) in C 
v := v + 1 

return v — 1 


Fig. 4: Pseudo-code for projected model counting via counting models of CNF 
encoding of projected DNNF. 


literals in a d-DNNF is a proper projection operation, and Lemma 3 establishes 
logical equivalence between D and Dp modulo variables in V . 


Example 3. Consider the formula F with priority variables p 1 q and non-priority 
variables x,y,z: 


h%,p), (q, (^p, -<y, —iz, q), ( x,q ), (-. q,p ) 

The projected model count is 2 ((p, q) and (p, -'(?)). 


V 



(a) d-DNNF Projected 


ai <=> q V ->q 
a2»fliAp 
a 3 <=?■ p A q 
a4 O a2 V as 

04 

(c) Formula from d2c 

d-DNNF 


Fig. 5: Example of application of d2c 


Figure [5] shows the initial d-DNNF (l5al) . the DNNF obtained by replacing 
all non-priority literals by true and simplifying © and the d 2 c translation of 
the projected DNNF (l5cl) . Notice that if we perform model counting naively on 




the projected DNNF, we get a count of 3 since we double count the model (p, q). 
The satisfaction probability is: 


,1 1 . 1 .1 1 , 
^2 + 2 ^ X 2 + ^2 X 2 ^ 


3 

4 


From satisfaction probability, we get the wrong model count 2 2 x | =3. However, 
if we count the models of the translated formula in (l5cl) , we get the correct count 
of 2. □ 


5 Experiments 

We compare the following solvers on various benchmarks: CLASP in its projection 
mode (clasp), our extension of clasp with cube minimization (=#=CLASP), model 
counting with searching on priority variables first (dSharp_P), and counting 
models of projected DNNF (d2c). In each row of the following tables, \V\ is the 
number of priority variables. T and D represent the execution time and number 
of decisions taken by the solver. I? is a parameter to gauge the quality of cubes 
computed by #CLASP, the higher it is, the better. It is equal to log 2 ( ^°^ es )• 
A value of 0 indicates that all solution cubes computed have size 1, while the 
maximum value is equal to the number of priority variables, which is the unique 
case when there is only one cube and every assignments to priority variables 
is a solution. R essentially quantifies the advantage over enumeration, the less 
constrained a problem is, and the more general the cubes are, the higher the 
advantage. S is the size (in bytes) of the CNF computed by d2c that is sub¬ 
sequently given to the solver SHARpSAT for model counting. The timeout for 
all experiments is 10 minutes. All times are shown in seconds. The experiments 
were run on NICTA’s HPC cluster. Q 


5.1 Uniform random 3-SAT and Boolean circuits 

Table [T| shows the results from uniform random 3-SAT and random Boolean 
circuits. In this table, for each problem instance, we show how the solvers perform 
as we increase the number of priority variables. A “... ” after a row means that 
every solver either ran out of time or memory for all subsequent number of 
priority variables until the next one shown. For each instance, a row is added 
that provides the following information about it: name, number of solutions as 
reported by dSharp, number of variables and clauses and time and decisions 
taken by dSharp. Note that this time should be added to the time of d2c in 
order to get the actual time of d2c approach. 

Let us look at the results form uniform random 3-SAT. All instances have 100 
variables, and the number of clauses is varied. We try clause-to-variable ratios of 
1, 1.5, 2, 3 and 4. Note that for model counting, the difficulty peaks at the ratio 

1 All benchmarks and solvers are available at: 

http://people.eng.unimelb.edu.au/pstuckey/countexists 




of approximately 1.5 n2j. For the first 3 instances, #CLASP is the clear winner 
while CLASP also does well, dSharp_P lags behind both, and d2c does not even 
work since the original instance cannot be solved by dSharp. For =#=CLASP, as 
we increase the number of clauses, the cube quality decreases due to the prob¬ 
lem becoming more constrained and cube minimization becoming less effective. 
For 300 clauses, we see a significant factor coming into play for dSharp_P. The 
original instance is solved by dSharp. As we increase the number of priority 
variables until nearly the middle, the performance of dSharp_P degrades, but 
after 50 priority variables, it starts getting better. This is because the degrada¬ 
tion due to searching on priority variables first becomes less significant and the 
search starts working more naturally in its VSADS mode m- d2c also solves 
two rows in this instance but is still largely crippled as compared to other solvers. 
Finally, with 400 clauses, we are well past the peak difficulty and the number 
of models is small enough to be enumerated efficiently by CLASP. All solvers 
finish all rows of this instance in less than .15 seconds. We tried the same ratios 
for 200 and 300 variables. For 200 variables, we saw the same trend, although 
the problem overall becomes harder and the number of solved rows decreases. 
For 300 variables, the problem becomes significantly harder to be considered a 
suitable benchmark. 

The Boolean circuits are generated with n variables as follows: we keep a set 
initialized with the n original variables, then as long as the set is not a singleton, 
we randomly pick an operator o (AND, OR, NOT), remove random operands 
V from the set, create a new variable v and post the constraints v <->■ o(V) and 
put v back in the set. The process is repeated c times. In the table, we show 
the results where n is 30, and c is 1,5,10. Note that a higher value of c means 
that the problem is more constrained. Overall, for all instances, dSharp _P is 
the superior approach, followed by CLASP; and d2c is better than #CLASP in 
c = 1 but the converse is true for higher values of c. All solvers find c = 5 to be 
the most difficult instance. We saw similar trends for different values of n that 
have appropriate hardness with same values of c. 


\V\ # 

CLASP 

T D 

#CLASP 

T DR 

dSharp _P 

T D 

d2c 

T D S 


UF #=— |V| = 100 |C| = 100 T =— D—— 

5 32 

10 1024 
15 32768 

25 2.7e+07 
35 1.8e+10 
50 1.9e+14 

0 2271 

.01 71309 

.40 2023146 
345.57 1.4e+09 

0 291 3.00 

0 533 7.19 

0 1888 10.30 

0 10584 17.36 

.02 62016 24.32 

107.75 7.1e+07 27.04 

.04 1150 

.99 35927 

7.92 370034 

— — — 




UF #= 

w= 

100 101=150 T —— D 

=- 



5 32 

0 1937 

0 

247 

3.00 

.03 1286 

— 

— 


10 1024 

.02 54077 

.01 

2933 

4.19 

.68 27142 

— 

— 

— 

15 32768 

.42 1767073 

0 

2101 

9.96 

31.63 1057551 

— 

— 

— 

25 2.1e+07 

270.91 8.7e+08 

.34 

393130 

11.25 

- - 

— 

— 

— 

35 2.8e+09 

— — 

24.98 

1.5e+07 

12.84 

— — 

— 

— 

— 






















UF #= 

=- w\= 

100 |C|=200 T —— D 

=- 



5 32 

0 1354 

0 

259 

2.42 

.04 1304 

— 

— 


10 1024 

.01 47771 

0 

1370 

5.25 

1.12 37596 

— 

— 

— 

15 30712 

.43 1408296 

0 

4659 

8.29 

37.07 874826 

— 

— 

— 

25 1.8e+07 

218.20 6.4e+08 

1.69 

1801261 

8.54 

— — 

— 

— 

— 



UF #= 

2.603e+ll |V| = 100 |C|=300 T=31.44 £>=571163 

5 32 

0 986 

0 646 0.75 

.05 671 

40.92 31 865K 

10 970 

.02 25441 

.02 11450 1.20 

2.07 14146 

102.63 969 1.6M 

15 12990 

.22 290973 

.11 61663 2.30 

12.74 144211 

- 6.7M 

25 226117 

3.84 3432170 

1.66 464908 3.21 

57.57 808670 

- 15M 

35 5126190 

49.02 6.6e+07 

15.65 3367386 4.67 

161.67 2834211 

- 38M 

65 1.6e+09 

- - 

- - - 

70.80 1552565 

- 89M 

75 2.0e+10 

- - 

- - - 

70.74 1330586 

— 104M 

85 2.9e+10 

- - 

- - - 

50.18 780597 

— 113M 

100 2.6e+ll 

— — 

— — — 

28.62 571163 

— 134M 


UF #—45868 |y| = 100 |C|=400 T=. 05 £> =244 


5 

7 

0 

1078 

0 

907 

0.49 

.08 

219 

.01 

6 

549 

10 

25 

0 

1308 

0 

1103 

0.94 

.14 

322 

.01 

17 

1.3K 

15 

32 

0 

1582 

0 

1242 

1.09 

.12 

376 

.01 

21 

3.IK 

25 

105 

.01 

2242 

0 

1290 

2.32 

.09 

373 

.01 

34 

3.4K 

35 

246 

.01 

3068 

0 

1338 

2.52 

.06 

363 

.01 

107 

8.6K 

50 

952 

.01 

6737 

.01 

2241 

3.24 

.05 

361 

.02 

202 

14K 

65 

3417 

.01 

16388 

.01 

2889 

4.41 

.05 

262 

.09 

321 

21K 

75 

7964 

.04 

26979 

.02 

2845 

5.32 

.05 

250 

.04 

426 

22K 

85 

13274 

.04 

36445 

.02 

3993 

5.18 

.05 

237 

.06 

563 

26K 

100 

45868 

.11 

46623 

.03 

4639 

6.74 

.04 

244 

.07 

688 

31K 



n=30 

0=1 # = 

-9.657e-j-08 \ V 

=99 |C| = 167 T 

=0 £>= 

111 


5 

16 

0 

409 

0 

292 

0.54 

0 

113 

.01 

4 

1.1K 

9 

160 

0 

3418 

0 

1184 

1.54 

0 

143 

0 

8 

1.2K 

14 

552 

0 

9305 

0 

5030 

1.00 

0 

82 

.01 

22 

3.8K 

24 

248960 

1.16 2718019 

1.49 

833682 

2.07 

0 

130 

.01 

169 

6.IK 

34 1621760 

6.13 

1.2e+07 

6.45 

1999088 

3.13 

.01 

111 

.05 

656 

9.7K 

49 3.9e+07 

104.26 

1.9e+08 

353.25 

1.4e+07 

4.78 

.01 

162 

.10 

1393 

14K 

64 

1.5e+08 

394.21 4.6e+08 

— 

— 

— 

0 

129 

.18 

2143 

18K 

74 4.4e+08 

— 

— 

— 

— 

— 

.01 

108 

.21 

2982 

20K 

84 7.2e+08 

— 

— 

— 

— 

— 

0 

99 

.20 

2624 

24K 

99 9.7e+08 

— 

— 

— 

— 

— 

.01 

111 

.20 

2517 

27K 


n= 

30 c=5 #=9.426e+07 \V\ 

=389 |C|=867 T=288.45 D= 

=1036363 

19 

12192 

.16 

155331 

.75 

146058 

0.00 

16.48 

120619 

— 

— 

5.8M 

38 

208716 

2.57 1882991 

15.23 

1985136 

0.00 

95.37 

834705 

— 

— 

47M 

58 

1.2e+07 

93.69 3.7e+07 

— 

— 

— 

— 

— 

— 

— 

100M 

97 3.3e+07 

248.76 6.9e+07 

— 

— 

— 

427.85 

1509308 

— 

— 

171M 

136 6.1e+07 

428.89 9.1e+07 

— 

— 

— 

— 

— 

— 

— 

252M 

291 

9.3e+07 

— 

— 

— 

— 

— 

300.78 

985065 

— 

— 

574M 

330 9.4e+07 

— 

— 

— 

— 

— 

299.02 

1074927 

— 

— 

672M 

389 9.4e+07 

— 

— 

— 

— 

— 

308.84 1036363 

— 

— 

783M 



n—30 

3=10 # 

=5066 \V 

=766 |C| = 1771 T=.32 £>=1400 


38 

282 

.01 

1196 

.03 

1797 

0.00 

.31 

1412 

.08 

256 

36K 

76 

1618 

.02 

3479 

.12 

5434 

0.00 

.40 

1600 

.81 

2046 

137K 

114 

2581 

.03 

4984 

.21 

7953 

0.00 

.52 

1787 

1.69 

3702 

173K 

191 

4948 

.05 

5558 

.52 

12243 

0.00 

.54 

1904 

4.98 

7519 

330K 

268 

5066 

.07 

5458 

.63 

12235 

0.00 

.38 

1784 

6.69 

10508 

478K 

383 

5066 

.09 

5513 

.85 

12356 

0.00 

.69 

1975 

9.27 12528 

698K 

497 

5066 

.08 

5253 

1.47 

12471 

0.00 

.39 

1680 

12.46 

11818 

1.1M 






































574 

5066 

.11 

5211 

1.68 

12358 

0.00 

.52 

1616 

14.85 

11911 

1.1M 

651 

5066 

.09 

5500 

1.21 

12546 

0.00 

.36 

1500 

13.42 

11807 

1.4M 

766 

5066 

.09 

5072 

2.06 

12389 

0.00 

.33 

1400 

21.61 

11644 

1.6M 


Table 1: Results from random uniform 3-SAT and Boolean circuits 


5.2 Planning 

Table [2] summarizes performance of different projected model counting algo¬ 
rithms on checking robustness of partially ordered plans to initial conditions. 
We take five planning benchmarks: depots, driver, rovers, logistics, and storage. 
For each benchmark, we have two variants, one with the goal state fixed and 
one where the goal is relaxed to be any viable goal (shown with a capital A in 
the table representing any goal). For the two variants, the priority variables are 
defined such that by doing projected model counting, we count the following. 
For the first problem, we count the number of initial states the given plan can 
achieve the given goal from. For the second problem, we count the number of 
initial states plus all goal configurations that the given plan works for. Each 
row in the table represents the summary of 10 instances of same size. The first 
3 columns show the instance parameters. For each solver, /shows how many 
instance the solver was able to finish within time and memory limits. All other 
solver parameters are averages over finished instances. Another difference from 
the previous table is that we have added the execution time of dSharp in d2c 
and dSharp time is shown in parenthesis. There was no case in which only 
dSharp finished and the remaining steps of d2c did not finish. 

Overall, dSharp _P solves the most instances (42), followed by #CLASP (41), 
d2c (34), and finally CLASP which solves only 4 instances from the storage 
benchmark, and otherwise suffers due to the inability to detect cubes. dSharp _P 
and d2c only fail on all instances in 2 benchmarks while #CLASP fails in 4, so 
they are more robust in that sense. For d2c, the running time is largely taken 
by producing the d-DNNF and the second round of model counting is relatively 
cheaper. The cube quality of #CLASP is quite significant for all instances that 
it solves. 


Instance 

CLASP 


#CLASP 


dSharp_P 


d2c 



Name |V| |C| \V\ 

STD 


T D 

R 

ST D 

s 

T 

D 

s 


depotsA 

9402 

211901 

224 

0 

— 

— 

0 

— — 

— 

1 

24.82 

92206 

1 

8.34 (6.98) 

1813 

154K 

depots 

9211 211796 

111.8 

0 

— 

— 

2 

4.16 4.43e+6 

31.54 

1 

24.74 

91909 

1 

7.71 (6.67) 

1642 

149K 

driverA 

2068 

12798 

135 

0 

— 

— 

0 

— — 

— 

5 

36.01 

27104.8 

3 

164.73 (161.42) 

68.33 

150.5K 

driver 

1999 

12700 

68 

0 

- 

— 

10 

0.31 1.23e+5 

51.7 

5 

15.8 

1.45e+4 

3 

118.67 (116.00) 

29.3 

109K 

logisticsA 

18972 324568 

447 

0 

— 

— 

0 

— — 

— 

0 

— 

— 

0 

— 

— 

— 

logistics 

18702 324352 

224 

0 

— 

— 

6 

33.52 1.81e6 

165.09 

0 

— 

— 

0 

— 

— 

— 

roversA 

3988 

27634 

209 

0 

— 

— 

0 

— — 

— 

5 

69.92 

51965 

3 

1.11 (1.06) 

53.33 

5.37K 

rovers 

3851 

27535 

104 

0 

— 

— 

10 

0.30 36769.7 

88.16 

5 

76.26 

52245.4 

3 

1.04 (1.01) 

12.67 

3.4K 

storageA 

915 

3465 

93 

1 

454.2 

2.5e9 

3 

43.81 3.89e7 

18.01 

10 

49.04 47112.50 

9 

103.35 (78.60) 

1964.2 440.21K 

storage 

851 

3420 

47 

3 

15.05 

7.87e7 

10 

0.05 30686 

30.47 

10 15.48 

12444 

9 

57.1 (53.46) 625.67 254.58K 


Table 2: Results from robustness of partially ordered plans to initial conditions. 

















6 Related Work and Conclusion 


The area of Boolean Quantifier Elimination (BQE) seems closely related to pro¬ 
jected model counting. Although the goal in BQE is to produce a non-priority 
variables free representation (usually CNF), some algorithms can also be adapted 
for projected model counting. Of particular interest are techniques in [3] and 
m- The algorithm of 0 finds cubes in decreasing (increasing) order of cube 
size (number of literals in cube). While this approach does not require cube 
minimization, it does not run in polynomial space, and if a problem only has 
large cubes, then significant time might be wasted searching for smaller ones. 

The second interesting approach, with promising results, is given in Ej, which 
uses a DPLL-style search and also decomposes the program at each step like the 
approach described in 14.II A good direction for future work is to investigate how 
well these techniques lend themselves to projected model counting and whether 
there is any room for integration with the ideas presented in this paper. 

In this paper we compare four algorithms for projected model counting. We 
see that each algorithm can be superior in appropriate circumstances: 

— When the number of solutions is small then CLASP [Bj is usually the best. 

— When the number of solution cubes is much smaller than solutions, and there 
is not much scope for component caching, then ^CLASP is the best. 

— When component caching and dynamic decomposition are useful then dShakp_P 
is the best. 

— Although d2c is competitive, it rarely outperforms both #CLASP and dSharp_P. 
Having said that, d2c approach has another important aspect besides pro¬ 
jected model counting. It is a method to perform projection on a d-DNNF 
without losing determinism. This can be done by computing the d-DNNF of 
the CNF produced by the d2c procedure (instead of model counting), and 
then simply forgetting the Tseitin variables (replacing with true). It can be 
shown that this operation preserves determinism. Furthermore, our exper¬ 
iments show that the last model counting step takes comparable time to 
computing the first d-DNNF in most cases (and in many cases, takes sig¬ 
nificantly less time), which means that the approach is an efficient way of 
performing projection on a d-DNNF. 

While we use d-DNNF for d2c approach, it is possible to use other, less suc¬ 
cinct, languages like Ordered Binary Decision Diagrams (OBDDs). We leave 
the comparison with other possible knowledge compilation-based approaches 
for projected model counting as future work. 

As the problem of projected model counting is not heavily explored, there is sig¬ 
nificant scope for improving algorithms for it. A simple improvement would be to 
portfolio approach to solving the problem, combining all four of the algorithms, 
to get something close to the best of each of them. 
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